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Abstract 

Ant Colony Optimization (ACO) has time complexity 0(tmN 2 ), and its typical application is 
to solve Traveling Salesman Problem (TSP), where t, m, and N denotes the iteration number, 
number of ants, number of cities respectively. Cutting down running time is one of study focuses, 
and one way is to decrease parameter t and N, especially N. For this focus, the following method is 
presented in this paper. Firstly, design a novel clustering algorithm named Special Local Clustering 
algorithm (SLC), then apply it to classify all cities into compact classes, where compact class is the 
' class that all cities in this class cluster tightly in a small region. Secondly, let ACO act on every 

z 

class to get a local TSP route. Thirdly, all local TSP routes are jointed to form solution. Fourthly, 
the inaccuracy of solution caused by clustering is eliminated. Simulation shows that the presented 
method improves the running speed of ACO by 200 factors at least. And this high speed is benefit 
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from two factors. One is that class has small size and parameter N is cut down. The route length 
at every iteration step is convergent when ACO acts on compact class. The other factor is that, 



using the convergence of route length as termination criterion of ACO and parameter t is cut down 
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I. INTRODUCTION 



A. Introduction of Ant Colony Optimization (ACO) 



In 1991, Ant Colony Optimization (ACO) was presented firstly by M. Drigo[7]and applied 



;o[J 



111 ] . Drigo et al. create a new research topic 



to solve TSP firstly by V. Mahiezzo et al. 
which is studied by many scholars now. 

ACO is essentially a system based on agents that simulate the natural behavior of ants, in 
which real ants are able to find the shortest route from a food source to their nest, without 
using visual cues by exploiting pheromone information[lo| . Pheromone is deposited when 
ants walking on a route. It provides heuristic information for other ants to choose their 
routes. The more dense the pheromone trail of a route is, the more possibly the route is 
selected by ants. At last, nearly all ants select the route that has the most dense pheromone 
trail, and it's the shortest route potentially. 



ACO has 
TSP0,Q. 



3een applied to solve optimization problems widely and successfully, such as 
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121 1. quadratic assignment problem 
□ 

classification or clustering analysis [20], biology 




image processing 



291 ] , data mining 



34|, 



The application of ACO leads the theo- 



retic study of ACO. Gutijahr firstly analyzes the convergence property of ACO 



16J. Stutzle 



& Dorigo prove the important conclusion, that if the running time of ACO is long enough, 



ACO can find optimal solution possibly 



39J. The other interesting property is revealed cur- 



rently by Birattari and et al. that the sequence of solutions of some algorithms does not 
depend on the scale of problem instance [l|. 

Running time is too long and the quality of solution is still low, that are the two main 
problems of ACO. To solve the main problems, the configuration of the parameters is 



discussed [lo|, lll|]. The method of adaptation is used to improve ACO 
putation and other method are used to accelerate ACO {4]. 



9]. Parallel com- 



B. Clustering Correlates to the Running Time of ACO 

One of study focus of ACO is to cut down running time. The running time of ACO 
is 0(t max MA 2 ), and M = [N/ 1.5] in general, where t max , M and N denote the iteration 



number, number of ants, and number of cities respectively 12]. The running time is pro- 
portional to N 2 . Cutting down the number of cities N is the key to reduce running time. 



Therefore, classifying all cities into different classes and letting ACO act on each class will 
reduce running time heavily. Hu and Huang used this method to improve the running speed 
of ACO 18J, which named ACO-K- Means. It's faster than ACO by factors of 5-15 approxi- 



mately. Simulations show that ACO-K-Means algorithm is valid only to the set of cities that 
has evident clustering feature, and invalid to more general situation. ACO-K-Means implies 
that it is possible that using clustering method to improve the running speed of ACO. 



C. Introduction of Local Clustering Algorithm 

Clustering is the classification of objects of a set (named training set) into different classes 
(or groups), so that the data in each class (ideally) share some common trait. One of most 



popular clustering algorithm is K-Means Clustering Algorithm 
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3l| . K-Means Clustering 



Algorithm assigns each point to the cluster whose center (i.e., centroid) is nearest to it, then 



|3l|. 



update the centroid. Repeat this process until termination criterion is satisfied 

During the t — th iteration of K-Means algorithm, the i — th class has distortion that is 
defined as the average distance of each point and the class centroid, which is denoted by D*f* 
(1 < i < m), where m is the number of classes. Pang proves that for each i, the distortion 



sequence {Df } is convergent if the i — th class is separated from other classes evidently 



3l|. 



That is, distortion sequence is convergent locally. According to this property, an algorithm 



named local clustering algorithm (LC) is presented 32| . its essential idea is introduced 
as below. 

Stepl. K-Means is applied to a given training set to generate classes. 
Step2. The class which distortion Df' is convergent firstly is deleted from training set. 
Then update training set such that it is comprised by residual points. Go to stepl. 
Stop the process of stepl-2 until all data is classified. 

LC algorithm is faster than K-Means algorithm by factors of 4-13 approximately. 
Suppose the i — th class is nf 1 during the t — th iteration of K-Means algorithm. Set 

has entropy H(Rf^), where H{Rf 1 ) = — J2p(a) log 2 p(a) and p(a) is the probability of data 

«efi. (<) 



a. It is proved that entropy sequence {H(Rf^), H(R,f^), ■ ■ ■ , H(Rf^), • • • } is convergent [31 ] . 
That is, the convergent criterion of K-Means algorithm can be replaced by the convergence 
of entropy sequence [33j. The K-Means with convergent criterion of entropy convergence is 



fast by two of factors at least 
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33]. 



II. IMPROVE LOCAL CLUSTERING ALGORITHM TO GENERATE COMPACT 
CLASS 



A. Compact Set and The Method of Generation 

For any subset of Euclidean space R n , every sequence in this subset has a convergent 
subsequence, the limit point of which belongs to the set. This subset is called compact set. 
The conception of compact set (or compactness) is a topology conception. To understand it 
easily, compactness can be described visually as the phenomenon that many points cluster 
tightly in a small region, while non-compact set is the set that most of points clustering 
loosely in a big region. 

K-Means Clustering, LC or other algorithm aim to partition a training set into classes. 
Some classes are compact and some are not. The most common situation is that a class 
contains a compact subset and some loose points, and the compact subset is around the 
center of the class. That is, the central part of class is compact possibly. To extract 
compact subset from a class, the following 35-principle is introduced. 

For Gauss distribution, suppose that 5 denotes the deviation of random data. It is the 
3<5-principle that it's more than 99% probability that a random point falling into the central 



region of data set which radius is 3<5 3l|. The central region contains points more than 99%. 
Thus, if radius 35 is small enough and the number of points is big enough, the central region 
is compact. Even the central region with radius 35 is not compact, shortening the radius 
of central region to ^, If; and so on will make it compact. For Gauss distribution which 
is comprised by enough points, the compact central region always exists. In general, for a 
class generated by clustering algorithm, all distances of points and class centroid comprise 
a gauss distribution approximatively. Therefore, the central region of a class is compact 
possibly. 

Suppose the i — th class is Rf' at the t — th iteration of K-Means or LC algorithm. With 
the increase of iteration, class sequence {R\ \ R^\ • ■ ■ R\ f , Rf • • • } (1 < % < m) appears, 
where m denotes the number of classes. Let 
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, where 



denotes the number of elements in R\ and d(x, c\ ) denotes distance. 
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Clearly, is the distortion of class R\ l> and 5 J - t ' 1 is the approximation of deviation of 
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(t) 



(2) 



Kf = {x | d(x, cf ) < -(A W + 35?>),xeR?>(p > 0)} 
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(3) 



is the central region of class Rf\ Parameter p is used to shorten the radius of 
central region Kf' and makes it compact. Figfj] illustrates the 35-principle and compact 



subset K, 



(t) 




FIG. 1: The illustration of compact central region of a class: In a class R.f\ most of points cluster 
around their centroid and few points are far away from the centroid. Subset iO (i.e., the shadow 
part) is the central region of class R^ . Compact set is the set that many points cluster in a small 
region tightly. Increasing parameter p will shorten radius and make K± compact. 



B. Subroutine 1: Local Clustering Algorithm with 35-Principle 

The local clustering algorithm with 35-principle is used to classify points into classes and 
to extract compact central region of classes. Its essential idea is described as below. 



Firstly, apply LC algorithm to cluster data. And apply the criterion of entropy conver- 



gence (i.e., J — — — — ^ — ► 0) to mark the stable class R® 
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Secondly, Extract compact central region K\ from class R\ and preserve it as a genuine 
class. Remove Flf from training set and update it. Repeat above two steps until all compact 
central regions are extracted. The detail is described as below. 

Input parameters: 

T: Training Set 

m: The number of class 

£: The stop threshold for clustering. 

c\ (1 < % < m): Initial centroids set. 

p: A parameter to adjust the size of compact subsets iQ (p > 0) . 

Output: 

(f(T) = {Ki \ ■ • • , Kf\ ■ ■ ■ Km} (i.e., the set of compact subset, see figH]) 
a{T) = {Bf ] , , ■ ■ ■ ,B]P,---B$}, where £f = R® - Kf\ and it is comprised by 
dispersive points ( 1 < i < m, see figH]) 

void Subroutinel ( T ,m,£,C {0 \p^{T),a(T)) 

{ 

Stepl. Initialization: Let iteration number t = 0. Let Counter = m. Let y?(T) = </> and 
cr(T) = 0, where denotes empty set. According to initial centroids set C^°\ generate initial 
partition of training set (p^ = {R^ \ R^ C T, 1 < i < m}. 

Step2. While {Counter > 0) { 

Step2.1. Generate new centroids set C^ t+1 ^ = {cf +1 ^ | 1 < i < m)} and new partition 
<^(*+l) = {Rf + V | 1 < z < m } 

/* Note: Check whether entropy sequence {iY(-R-°' ) ), H(R^), ■ ■ ■ , H(Rf'), ■ ■ ■} is conver- 
gent. If it is convergent, let the convergent marker StableMarker(Rf +1 ^) = True*/ 
Step2.2. For (i = 1; % < Counter; i++) { 

Estimate the entropy of class Rf , i.e., H(Rf +1 ^) = log 2 

If ( ^^'^^ ^ < £ ) { StableMarker{Rf +1) ) = True; } 
else {StableMarker(Rf +1) ) = False} 

} 

/*Note: Extract the data around the centroid of class as a genuine class */ 
Step2.3. For (i = 1; i < Counter; i++) { 
If (StableMarker(Rf +l) ) = True) { 
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Calculate compact central region K\ according to formula ([3]) 

Calculate B®: B® = Rf +1) - K® 

Let cp(T) = <p(T) U K® 

Let a(T) = a{T) U B® 

Update Training Set: T = T — R® 

Update centroids set: C^ = - {cf +1) } 

Counter = Counter — 1 

} 

} 

t = t+l. 

} 

} 



C. Special LC Algorithm to Generate Compact Classes (SLC) 

Note that above subroutine 1 is not a partition of training set. Subroutine 1 extracts only 
compact central regions of all classes and the residual points are unclassified. The residual 
points comprise a new training set. And it's possible that some of residual points cluster 
together tightly and comprise some small compact subsets again. These small compact 
subsets are new classes. To obtain these new classes and classify all points, SLC algorithm 
is described as below. 

Input parameters: 

T : Training Set 

wig: The initial number of classes, 
e: The stop threshold for clustering. 
Output: 

Num: The final number of classes. 

CLS: The partition of Tq, in which each class is compact. 
SLC Algorithm: 

Stepl. Initialization: Let T = T , m = m , CLS = (f), and p = 0. 

Step2. For (i=0;i < [log 2 m]; i++) /*Note: [log 2 m] denotes the integer*/ 

{ Step2.1 Generate initial centroids set C^ = {cf | 1 < % < m)}. 
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Step2.2 Call Subroutinel ( T,m,e,C^ ,p,<p(T),a(T)) 
Step2.3 CLS = CLSU<p(T); 
Step2.4 T = a(T); 

/*Note: Increase p to get smaller compact class*/ 
Step2.5m= [f};p = p+l; 

} 

Step3 Every residual point x in the last set o~(T) is regarded as a class {x}. And let CLS = 
CLS U {x}. Let Num denote the number of classes contained in CLS. The two outputs are 
CLS and Num. 

D. The Clustering for Mixture Distribution (SLC-Mixture) 

The clustering algorithm SLC presented above generates spherical classes only. However, 
for a general distribution, some classes are spherical shape, some classes are chain shape in 
which points cluster closely around a curve (or a line), even some classes contain isolated 
points. This common distribution is called mixture distribution. For a large-scale TSP, 
the distribution of cities is mixture distribution in general. The clustering method for 
mixture distribution is proposed as below. 

1. The Simple Maker to Distinguish Spherical Class from Chain-Shaped Class 

The position of city on a map is two-dimensional point. A given class can be divided into 
8 areas along the 4 directions of the north-south, west-east and two diagonal lines through 
the centroid of the class. If the class is spherical, the percentage of points in each area is 
close to 1/8 and is the same approximately. If the class is chain-shaped class (or part of 
chain-shaped class), it's impossible that the percentage of every area is close to 1/8 at the 
same time. Therefore, the percentage of points in each area is the maker of spherical class. 
FigfS] illustrates the marker. 
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FIG. 2: The Illustration of Distinguishing Spherical Class from Chain-shaped Class: A given class 
is divide into 8 parts along the 8 lines through the centroid of the class. If the class is spherical, 
the percentage of each part is close to threshold e = 1/8. If the class is chain-shaped class (or 
part of chain-shaped class), there are 2-4 parts which percentage is far less than 1/8. Therefore, 
the percentage of each part is the marker of spherical class, e = 0.058 in this paper because some 
classes are elliptical. 

2. Applying SLC to Process Mixture Distribution (SLC- Mixture) 

At first, apply SLC to classify all data of training set. Secondly, apply the marker 
presented above to distinguish spherical classes and extract them from the training set. 
Then all residual points comprise a new set named residual set. The residual set contains 
only chain-shaped classes and isolated points. Third, apply the method presented in Ref. [3] 
to classify all residual points of residual set into different chain-shaped classes or marked 
as isolated points. The method presented in Ref. [3] is named Chain-Shaped Clustering 
Algorithm and is introduced at appendix I. 

The clustering method presented in this section is called SLC-Mixture algorithm, which 
processes the mixture distribution of spherical classes, chain-shaped classes and isolated 
points. 
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III. APPLY SLC TO ACO 



A. The Termination Criterion of ACO 

Suppose ACO acts on a compact class and let L t denotes the minimum route 
length that is generated at the t — th iteration of computation. There are sequence 
{Li,L 2 , ••• , L t , L t+1 , ■ • • } and it is convergent under idea condition. The convergent cri- 
terion \ Lt ~Lt+\\ < e j s proposed as the termination criterion of ACO in this paper. 

In the following discussion, ACO refers to the algorithm which termination criterion is 
(\L t -L t+1 \/L t ) <e. 

B. Apply SLC to Improve The Running Speed of ACO (ACO-SLC) 

In this section, the clustering algorithm SLC will be applied to improve the running speed 
of ACO. The method is named ACO-SLC and it is described as below. 
Input Parameter: 
T: Set of cities 

Output: The shortest TSP route obtained by the algorithm 
ACO-SLC Algorithm: 

Stepl. Apply SLC algorithm to partition set T. The classes are B\, B2, ■ ■ ■ , Bi, ■ ■ ■ B^um, and 
their centroids are b±, 62, • • • ■ ■ ■ &7v nm respectively. 

Stpe2. Construct graph G : Centroids b\, &2> • ' ' > • • • b^ um are regarded as virtual cities 
respectively, and the virtual cities are regarded as the vertices of graph G . For a pair of classes B^ 
and Bj, if there exists two cities that belong to Bi and Bj respectively and they joint each other, 
use an edge to joint the two corresponding vertices bi and bj. The weight of edge is the minimum 
distance between two classes, i.e., 

d(B v Bj) = min {d(x v Xj) \ x^Bi, x^Bj} (4) 

Step3. Calculate a TSP route of graph G' to generate the traveling order of all classes: Let ACO 
algorithm act on graph G to find a TSP route denoted by bj 1 , bj 2 , • • • bj Num , where j± , J2 , • • • jNum 1 
is a permutation of sequence 1, 2, • • • Num. The pair of classes Bj i and Bj.. +1) is called neighbor 
class. 
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Step4. Choose an edge as the bridge to joint a pair of neighbor classes, and this edge is named 
bridge edge: Assume that the two neighbor classes are Bj 1 and Bj 2 . If there exist an edge such 
that 

d(x u ,x v ) = min {d(a, b) \ aeB h ,beB h } (5) 

, edge (x u ,x v ) is the bridge edge, x u and x v are called border cities, where vertices a and b 
should be not used to joint other neighbor classes. 

Step5. Calculate a local TSP route for every class -Bj(l < i < Num): Add a new edge 
to joint the two border cities in the class, and mark the edge as necessary edge of the local 
TSP route. This edge is named pseudo-edge. Let the ACO algorithm with convergence criterion 
(\L t — L t+ i \ I ' L t ) < e act on the class to generate a local TSP route. 

Step6. Construct a TSP route: Walk along the traveling order obtained at step3, for every pair 
of neighbor classes, delete the pseudo-edge of each class such that the local route is not close. Then 
let the local route of each class and the bridge edge between these two classes be jointed. 

Figj3] illustrates the processing of ACO-SLC algorithm. 




FIG. 3: The schematic diagram of ACO-SLC : Firstly, classify all points into compact classes. 
Secondly, the centroid of each class is regarded as a virtual city, and calculate a virtual TSP route. 
Then along the virtual route, joint all classes. Thirdly, let ACO act on each class to get a local 
TSP route. Fourthly, joint all local TSP routes along the virtual route to form the last route. 
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C. Using the Method of Little-window and Removing Cross-edge to Improve 
ACO-SLC (ACO-SLC-LWCR) 



Clustering may cause the error of solution although it improves the running speed of ACO 
heavily. If all classes are compact and separated clearly, the quality of solution of ACO-SLC 
should be very good. However, in fact, the border between two neighbor classes is fuzzy. The 
fuzzy border will cause the inaccuracy of solution, and much longer route will appear. And 
recognizing the longer part and removing it will generate better solution possibly. It is well 
known that, the shortest route is always at the surface of a convex hull. Thus, the longer 
part should be at the inner of a convex hull and two longer edges intersect. In other words, 
intersection of two edges is a marker of longer part of a route possibly. According to the 
marker, removing longer edges is call removing cross-edge or removing intersection 



12( | . before execute ACO, the 



edges, which is similar to the method in [121 ] . (Notice: in Ref. 
long and crossed edges are removed to improve the running speed of ACO, not to improve the 
solution quality.) 

FigJH illustrates the method of removing cross-edge. 




FIG. 4: The Illustration of Removing Cross-edges from TSP Route: At the left figure, 
AB and CD intersect each other. There is a principle that shortest route is at the surface of a 
convex hull. Thus, edge AB and CD is the longer part of route and should be removed. Removing 
these two edges will generate shorter route (see right figure). 

In addition, a simple method named little-window strategy is proposed to improve the 



running speed of ACO in Ref.[42|]. Construct a set Si that is comprised by w accessible and 
short edges which join the i — th city, where w is a pre-assigned constant. The ant which 
has arrived at % — th city will select an edge from window set Si only to arrive its next city, 
and not selection an edge from all neighbor edges of this vertex. So, this method improve 
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the running speed of ACO. In addition, the parameter configuration of the method is put 
at appendix II. 

The ACO-SLC with little-window strategy and cross-edges removing is called ACO- 
SLC-LWCR. 



D. The ACO-SLC for Mixture Distribution (ACO-SLC-Mixture) 

ACO-SLC is suitable for the spherical shape distribution only, and the low quality of 
solution will appear possibly when ACO-SLC is applied to process mixture distribution. To 
process mixture distribution, the following method named ACO-SLC-Mixture is proposed 
in this paper. 

Firstly, apply SLC-Mixture at section 2.4.2 to partition the set of cities into spherical 
classes, chain-shaped classes, or isolated points. Secondly, apply ACO-SLC-LWCR to each 
class and generate a TSP route. 



IV. SIMULATION 



In this section, five related algorithms ACO, ACO-K-Means, ACO-SLC, ACO-SLC- 
LWCR, ACO-SLC-Mixture are tested and compared. In the following simulation, ACO 
refers to Ant-cycle presented by Dorigo, which is very typical Q]. 



All test data in this paper is downloaded from http://www.iwr uniheidelberg.de 
/iwr/comopt/soft /TSPLIB95/TSPLIB.html. All algorithms in this paper run on personal 
computer. CPU: 1.80GHz. Memory: 480M. Software: Matlab. The parameters are listed 
as below. Initialize pheromone trails Tjj(O) = 1, iteration number 1000, e = 0.001, a — 1, 
P — 10, p — 0.4, Q = 300, m = [j^]- Two performance items are tested. One item is 
the running time, which is defined as Ratio = Time(ACO) /Time(Algorithm) . The bigger 
the ratio is, the faster the algorithm is. The other item is the quality of solution, which 
is defined as the percentage of error Error = (Solution — Optimum) /Optimum, where 
Optimum denotes the best solution known currently. The smaller the error is, the better 
the quality of solution is. 

The performances of the five algorithms are listed in Figj5l It shows that ACO-SLC, 
ACO-SLC-LWCR and ACO-SLC-Mixture are faster than ACO by 415~ 10736, 390~ 10192 
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and 257-9419 of factors respectively! However, some solutions of ACO-K-Means and ACO- 
SLC have low quality. The inaccuracy ratio of ACO-SLC-Mixture is less than ACO in most 
cases, and is bigger than ACO by 2% at most. 



The Ratio of Running Speed 
Ratio=Time(ACO)/Time(Algorithm) 
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FIG. 5: The Performance Comparison of The Five Algorithms of ACO, ACO-K-Means, ACO-SLC 
ACO-SLC-LWCR and ACO-SLC-Mixture: The figure shows that ACO-SLC algorithm, ACO-SLC, 
ACO-SLC-LWCR and ACO-SLC-Mixture are faster than ACO by 415-10736, 390~10192 and 257- 
9419 of factors respectively! However, some solutions of ACO-K-Means and ACO-SLC have low 
quality. ACO-SLC-Mixture can process mixture distribution and its inaccuracy ratio is less than 
ACO in most cases, and is bigger than ACO by 2% at most. 



The Defect of ACO-SLC: The authors have done many simulations under different 
conditions. And these simulations show the quality of ACO-SLC solution depends on the 
quality of clustering and clustering quality of SLC is sensitive to the initial centroids just 
liking K-Mean algorithm. This is main defect of ACO-SLC. All cities are two-dimensional 
(or three-dimensional) points and they are visual for general TSP problems. Hence, reason- 
able initial centroids can be selected by researcher's eyes to reduce the sensitivity of initial 
centroids. The initial centroids of the simulations of this paper is selected by the authors' 
eyes, and they are put at appendix III of this paper. 



V. CONCLUSION 



Time Complexity of ACO: ACO is the algorithm that inspired by the foraging be- 
havior of ant colonies and has be applied to solve many optimization problem. The typical 
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application of ACO is the application at Traveling Salesman Problem (TSP). The running 
time of ACO is 0(t max MiV 2 ), where t max , M and N denotes the iteration number, number 
of ants, and number of cities respectively. Parameter m is a experiential value and is set 
to [jj.] in general. Parameter N is the key factor of running time because running time is 
proportional to its square. Parameter t max and N are available, and decreseaing parameter 
^max and N will cut down running time. 

Focus of ACO Study: ACO can generate solution with high quality in general. But 
its shortage is that running time is too long. Cutting down running time is one of study 
focuses of ACO, and one way is to decrease parameter t max and N, especially N. 

Basic Idea for This Study Focus: For this study focus, the following basic idea is 
presented in this paper. 

Firstly, all cities are classified into compact classes, where compact class is the class that 
all cities in this class cluster tightly in a small region. 

Secondly, let ACO act on every class to get a local TSP route. 

Thirdly, all local TSP routes are jointed to form solution. 

Fourthly, the inaccuracy of solution caused by clustering is eliminated. 

Realization of Basic Idea: The realization of above idea is based on a novel clustering 
algorithm presented in this paper, which is named Special Local Clustering algorithm (SLC). 
The running time of SLC is far less than the time of ACO. SLC generates compact classes, 
while current popular clustering algorithm such as K-Means does not generate compact 
classes in general. The compactness of class makes the length of TSP route L t at every 
iteration convergent, the convergence of L t (i.e., (\L t — L t +i \ / L t ) — > 0) is proposed as the 
termination criterion of ACO in this paper. Thus, parameter t max is cut down to improve 
the running speed of ACO. In addition, every class has small size, ACO acting on small 
class makes parameter N is cut down, and running speed is improved. According to this 
analysis, ACO-SLC algorithm is presented in this paper. Simulation shows that ACO-SLC 
is faster than ACO by 415~10736 of factors! 

Elimination of the Solution Inaccuracy Caused by Clustering: Although the 
running speed is improved in this paper, the inaccuracy of solution is heavy. Two factors 
causing the inaccuracy are found in this paper. One is the cross-edges (see section 3.3), and 
the other factor is the unmatch between ACO-SLC and mixture distribution (see section 
3.4). According to these two factors, ACO-SLC- LWCR and ACO-SLC- Mixture are presented 
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in this paper, which is the improvement of ACO-SLC. Simulation shows that ACO-SLC- 
LWCR and ACO-SLC-Mixture is faster than ACO by 390~ 10192 and 257-9419 of factors 
respectively! The inaccuracy ratio of ACO-SLC-Mixture is less than ACO in most cases, 
and is bigger than ACO by 2% at most. 
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VII. APPENDIX 



Appendix I: Chain-Shaped Clustering Algorithm of Ref. 19] 



This algorithm is used to process the data set which distribution is chain-shaped only, and is introduced as below. 



19 



Stepl. Unify all data and calculate the centroid of data set, and select the point that is farthest from the centroid as the seed of the first 



Step2. Select the points that are close to the seed into the first class as many as possible until the trace of the first set is bigger than the 
prc-assigncd threshold (Notice: The set of random data has covariancc matrix. The trace of covariancc matrix is proportion to the size of region 
containing these random data. The smaller the trace is, the smaller the size is. Thus, if the trace is less than a pre-assigncd threshold, the class 
is still small and compact. The threshold is 0.0005 in this paper). 

Repeating the two steps, the first class, second class, 3rd class, and so on, will be obtained until all points are classified. Finally, merge all 
neighbor classes as a new big class. E.g., if class A is close to B and B close to C the three classes A, B and C are neighbors and arc merged as a 
big chain-shaped class. 

Appendix II: The Parameter Configuration of Little- Window Method 

The configuration of parameter w in little-window strategy: Let n denotes the number of neighbor cities, If n < 21, let w— min(n-l, 8); Else 
if n < 101, let w=min(n-l, 9); Else if n < 144, let w=min(n-l, 13) ; Else if n < 1000, let w=min(n-l, 19); Else ifn < 4000, let w=min(n-l, 100); 
Else let w = [n/10]; 

Appendix III: Initial Centroids of Classes 

The initial centroids of classes which is used by clustering algorithm SLC: Each initial centroids has coordinate (x, y) 
X= 5000 5000 5000 5000 12000 12000 12000 12000 

Prl36 

y= 3000 5000 8000 10000 10000 8000 5000 3000 

X= 550 4800 4800 2000 4000 12000 12500 10000 12500 16500 13000 8800 

Pr226 

y= 2000 7000 9000 9000 12000 12000 8500 8500 7000 3500 2000 3500 

X= 50 90 90 40 40 150 300 200 270 200 260 400 400 450 550 400 560 700 700 700 

Chl30 

y= 50 190 340 450 680 100 50 250 400 540 650 50 150 300 200 540 400 200 380 550 
X= 500 500 500 500 1400 1400 1400 1100 2300 2300 2300 2000 3000 3000 3000 3000 

Lin318 

y= 500 1500 3100 4000 500 1800 3200 4000 500 1700 3100 4000 500 1500 2700 3600 

X= 3700 3670 3830 3800 3700 4000 2100 1800 1600 1300 950 700 1500 1160 1500 1600 2100 

D198 

y= 1400 1400 1400 1150 1050 1050 1870 1600 1480 1400 1350 1000 1000 1170 1190 1400 1400 

X= 1070 1070 1070 1500 1070 1070 1070 1070 2000 2500 2000 3500 4300 4500 5840 5840 5840 5840 5840 5840 

P654 

y= 2400 2200 2710 3600 4800 4500 5050 5200 2500 4000 5000 3500 3500 2400 2200 2710 2700 4900 4600 5300 
X= 8500 8500 8500 8500 8500 8500 8500 8500 8500 16000 16000 16000 16000 16000 16000 16000 16000 16000 

Prl07 

y= 4800 5500 6400 7100 8000 8900 9800 10600 11500 4800 5500 6400 7100 8000 8900 9800 10600 11500 
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